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Abstract 

Acquisition of Elizabethkingia infections in intensive care units (ICUs) has risen in the past decade. Treatment of Elizabethkingia 
infections is challenging due to the lack of effective therapeutic regimens, leading to a high mortality rate. Elizabethkingia infections 
have long been attributed to Elizabethkingia meningoseptica. Recently, we used whole-genome sequencing to reveal that 
E. anophelis is the pathogenic agent for an Elizabethkingia outbreak at two ICUs. We performed comparative genomic analysis of 
seven hospital-isolated E. anophelis strains with five available Elizabethkingia spp. genomes deposited in the National Center for 
Biotechnology Information Database. A pan-genomic approach was applied to identify the core- and pan-genome for the 
Elizabethkingia genus. We showed that unlike the hospital-isolated pathogen E. meningoseptica ATCC 12535 strain, the hospital- 
isolated £ anophelis strains have genome content and organization similar to the E. anophelis Ag1 and R26 strains isolated from the 
midgut microbiota of the malaria mosquito vector Anopheles gambiae. Both the core- and accessory genomes of Elizabethkingia spp. 
possess genes conferring antibiotic resistance and virulence. Our study highlights that £ anophelis is an emerging bacterial pathogen 
for hospital environments. 
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Introduction 

Elizabethkingia is a genus of aerobic, nonmotile, Gram-neg- 
ative rods that is ubiquitous in nature. Members of this ge- 
nus thrive in wet habitats and hospital settings, in particular 
water supplies and saline flushing solutions. Among the 
Elizabethkingia spp., the species Elizabethkingia meningo- 
septica is well established as a serious causative agent of 
neonatal meningitis and sepsis (Dooley et al. 1980), and a 
notable rise in £ meningoseptica nosocomial infections has 
been recorded in recent years. Treatment of £ meningosep- 
tica infections is notoriously difficult, and there is a lack of 



effective specific therapeutic regimens (Hsu et al. 201 1). The 
mortality rate of nosocomial infections caused by £ menin- 
goseptica can reach as high as 52% in neonates (Bloch 
et al. 1997) and ranges from 23% (Teres 1974) to 33% 
(Bloch et al. 1997) in nonneonates. Hence, the acquisition of 
£ meningoseptica in intensive care units (ICUs) is used as a 
significant predictor of mortality (Teres 1974). Except for 
£ meningoseptica, it is rarely reported that other 
Elizabethkingia species can cause infections although £ mir- 
icola has been reported to be associated with sepsis (Green 
et al. 2008). 
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Fig. 1. — A phylogenetic tree showing the 1 2 Elizabethkingia spp. This phylogenetic tree was produced by pair-wise genome comparisons by Progressive 
Mauve. 
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Fig. 2. — SNP distance matrix among the 12 Elizabethkingia spp. SNP difference between each pair of Elizabethkingia spp. was calculated by using the 
snpTree web server. 
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Fig. 3. — Sequence comparison by alignment. The Elizabethkingia anophelis NUHP1 strain (red arc) was aligned against: E. anophelis R26 strain (blue arc), 
E. anophelis Ag1 (orange arc), E. meningoseptica ATCC 1 2535 (NITE) (yellow arc), and E. meningoseptica 502 (light green arc). Colored links between contigs 
represent homologous regions spanning: 1 0 2 -1 0 3 bp (green), 1 0 3 -1 0 4 bp (blue), and above 1 0 4 bp (red). The presence of red links between the NUHP1 strain 
and E. anophelis R26, Ag1 contigs, and E. meningoseptica 502 contigs indicate a high degree of similarity between these genomes. 



The limited genomic information available for 
Elizabethkingia hinders our understanding of the virulence 
mechanisms and molecular epidemiology of its member 
species. Recently, we used whole-genome sequencing to 
investigate an Elizabethkingia outbreak in two ICUs at 
the National University Hospital, Singapore (Balm et al. 
2013). All the patient-associated Elizabethkingia strains 
and hand hygiene sink aerator-associated Elizabethkingia 



strains were isolated within a 1 -month period at the 
ICUs. We found the outbreak agent to be a novel spe- 
cies — E. anophelis (Teo et al. 2013), usually found in the 
midgut microbiota of the malaria mosquito vector 
Anopheles gambiae. Here, we report comparative genomic 
analysis on the Elizabethkingia spp. to investigate their 
mechanism for virulence, stress response, and niche 
adaptation. 
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General Characteristics of the Outbreak E. anophelis 
Strains 

The general genomic characteristics of the seven 
Elizabethkingia spp. strains derived from this study, three pa- 
tient (NUHP1, NUHP2, and NUHP3) and four sink isolates 
(NUH1, NUH4, NUH6, and NUH1 1), obtained from the 
RAST server (Aziz et al. 2008) are presented in supplementary 
table S1, Supplementary Material online. 

The Whole-Genome Shotgun (WGS) sequences of five 
previously sequenced E. anophelis strains Ag1 (Bioproject ac- 
cession number, PRJNA80705) and R26 (Bioproject accession 
number, PRJNA178189), E. meningoseptica ATCC 12535 
(NITE) (Bioproject accession number, PRJNA1 99489), 
E. meningoseptica ATCC 12535 (OSU) (Bioproject accession 
number, PRJNA198814), and E. meningoseptica 502 
(Bioproject accession number, PRJNA176121) were also sub- 
mitted to the RAST server; supplementary table S2, 
Supplementary Material online, shows the general character- 
istics of these strains. The genome sizes and GC content of the 
hospital-isolated strains are similar to the other E. anophelis 
strains and one of the E. meningoseptica strains (502) (sup- 
plementary tables S1 and S2, Supplementary Material online). 

Comparative Genomic Analysis 

Multiple genome alignment was performed by using 
Progressive Mauve (Darling et al. 2010) to compare all the 
genomes of the Elizabethkingia spp. The phylogenetic tree 
based on the multiple genome alignment showed that 
E. anophelis and E. meningoseptica genomes belong to dis- 
tinct groups (fig. 1). Single-nucleotide polymorphism (SNP) 
differences between each pair of Elizabethkingia spp. was 
calculated using the snpTree web server (Leekitcharoenphon 
et al. 2012) and shown in figure 2. These results showed that 
the genomes of the patient isolates from the current outbreak 
(NUHP1 , NUHP2, and NUHP3) were very similar to each other, 
NUH1 and NUH4, which suggests that the E. anophelis strains 
NUHP1, NUHP2, NUHP3, NUH1, and NUH4 may be clonal. 
The NUH6 and NUH1 1 strains may exist as a separate clone as 
they were found to have high-genomic similarity to each other 
but not to the other hospital isolates. The E. anophelis Ag1 
and R26 strains formed a distinct group in the phylogenetic 
tree. Also, the E. meningoseptica ATCC 12535 (NITE) and 
ATCC 1 2535 (OSE) strains formed a distinct group in the phy- 
logenetic tree, whereas the E. meningoseptica 502 formed yet 
another distinct group. 

To reach a high-resolution comparison, multiple whole-ge- 
nome sequence alignments were performed with CIRCOS 
v0.64 (Krzywinski et al. 2009), using the E. anophelis 
NUHP1 strain, E. anophelis Ag1 and R26 strains, and the 
E. meningoseptica ATCC 12535 (NITE) and 502 strains 
(fig. 3). The red links show a high number of large homolo- 
gous regions (longer than 10,000 nucleotides) between 
E. anophelis NUHP1 strain and the E. anophelis Ag1 strain, 
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Fig. 4. — Curves for the core-genomes, pan-genomes, and numbers 
of new genes of the 12 Elizabethkingia spp. 



E. anophelis R26 strain, and the E. meningoseptica 502 strain. 
The genome of the E. meningoseptica ATCC 12535 (NITE) is 
less close to the E. anophelis NUHP1 (fig. 3). This result corre- 
lates well with the phylogenetic tree result (fig. 1) and GC 
content (supplementary tables S1 and S2, Supplementary 
Material online) of the genomes of these Elizabethkingia spp. 

Core/Pan-Genomic Analysis of Elizabethkingia spp. 

We performed core/pan-genomic analysis of the 
Elizabethkingia spp. The results of all permutations of the 
order of addition for each of the 12 genomes are presented 
in figure 4. As expected, the number of genes in the core- 
genome initially decreases and that of the pan-genome ini- 
tially increases with addition of each new genome sequence. 
Extrapolation of the curve indicates that the core-genome 
reaches a minimum of 2,589 "core" genes (fig. 4). In accor- 
dance with the core-genome size, the pan-genome reaches a 
maximum of 5,575 genes in total across 12 genomes (fig. 4). 

Resistance and Virulence Profile of the Elizabethkingia 
spp. 

The core/accessory genomes were searched against the 
Comprehensive Antibiotic Resistance Database (McArthur 
et al. 2013) and Virulence Factors of Pathogenic bacteria 
Database (VFDB) (Chen et al. 201 2) to identify antibiotic resis- 
tant genes and virulence genes. Thirty percentage identity and 
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expectation value < 1e-5 were used as a threshold when per- 
forming the BLASTP searches because the genomes of 
Elizabethkingia spp. are very new and highly likely to not 
have been included in any of these databases before. 
Sixteen and 19 antibiotic-resistant genes were identified 
from the core- and accessory genomes of Elizabethkingia 
spp., respectively, which cover genes conferring resistance 
to aminoglycosides, beta-lactamases, fluoroquinolones, glyco- 
peptides, macrolide-lincosamide-streptogramin, tetracyclines, 
trimethoprim, and rifampicin (table 1). These genes correlate 
with the reported antibiotic resistant profiles of 
Elizabethkingia spp. (Hsu et al. 2011). The patient-isolated 
NUHP1, NUHP2, and NUHP3 strains are resistant to tetracy- 
cline (minimum inhibitory concentration, MIC = 64|ig/ml), 
ciprofloxacin (MIC = 1 6 jig/ml), erythromycin 

(MIC > 512|ig/ml), ceftazidime (MIC = 64 jig/ml), tobramycin 
(MIC > 512(ig/ml), and vancomycin (MIC = 32 jig/ml). Some 
of the antibiotic resistance genes might be nonfunctional. A 
complete genome sequence is required for intact analysis of 
these resistance gene operons. 

In our study, 146 and 70 virulence genes were identified 
from the core- and accessory genomes of Elizabethkingia spp., 
respectively, which include genes involved in lipopolysaccha- 
ride biosynthesis, iron siderophore synthesis, heme uptake, 
transposase synthesis, alginate synthesis, and so on (supple- 
mentary table S3, Supplementary Material online). 

Functional Classification of Genes Only Belongs to 
E. meningoseptica or £ anophelis 

To gain knowledge about the difference in metabolic capacity 
between E. meningoseptica and E. anophelis, we enriched 
genes that exist only in E. meningoseptica but not in 
E. anophelis (referred to as EM only) and genes that exist 
only in E. anophelis but not in E. meningoseptica (EA only) 
based on BLAST analysis. There were 842 genes unique to the 
five E. meningoseptica genomes and 1,416 genes unique to 
the seven E. anophelis genomes. These unique genes were 
then classified according to their predicted functional role 
(fig. 5). When compared with the EA-only genes, the EM- 
only genes are enriched in predicted proteins belonging to 
Clusters of Orthologous Group (COG) category G (carbohy- 
drate transport and metabolism), H (coenzyme transport and 
metabolism), I (lipid transport and metabolism), K (transcrip- 
tion), R (general function prediction only), S (function un- 
known), and T (signal transduction mechanisms) (fig. 5). 
Conversely, the EA-only genes are enriched in C (energy pro- 
duction and conversion), L (replication, recombination, and 
repair), and P (inorganic ion transport and metabolism) (fig. 5). 

In conclusion, our study revealed the emergence of a 
novel pathogen E. anophelis in the hospital environment. 
Elizabethkingia anophelis is well known to be a dominant spe- 
cies in the gut microbiota of the malaria mosquito 
vector A. gambiae (Dong et al. 2009; Boissiere et al. 2012; 
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Fig. 5. — The identified EM-only and EA-only proteins were assigned to Clusters of Orthologous Groups (COGs). They axis indicates the percentage of 
genes in a specific function cluster out of the total numbers of EM-only and EA-only proteins, respectively. *The abundances of specific function clusters were 
compared statistically as described in Rodriguez-Brito et al. (2006) and Allen et al. (2009) using a subsample size of 500 and 1,000 bootstrap replicates at a 
statistical confidence of 99%. 



Osei-Poku et al. 2012). Elizabethkingia species in the midgut 
of the malaria vector may modulate the anti-Plasmodium ef- 
fects of the host's immune genes, thus prolonging the life 
span of the PlasmodiumAnfecXed malaria mosquito vector 
(Dong et al. 2009). The genome content and organization 
of E. anophelis is similar to, yet distinct from the well-known 
E. meningoseptica, which often causes high mortality among 
hospital acquired infections. Our study suggests that the mos- 
quito vector might be a potential mobile reservoir of antibiotic 
and virulence genes for emerging bacterial pathogens. 
However, we should notice that the core- and pan-genome 
analysis is based on only a small group of genomes. More 
genomes of the Elizabethkingia spp. are required for sophis- 
ticated comparative genomic analysis. Further studies will be 
carried out to comparatively investigate the pathogenesis 
mechanisms employed by E. anophelis and E. meningoseptica 
in causing human infections. 

Materials and Methods 

Ethics Statement 

Ethical approval was not required for the study because it was 
done as part of surveillance and management of healthcare- 
associated infection. 



Genome Sequencing, Assembly, and Comparative 
Genomic Analyses 

The genomes of seven Elizabethkingia spp. strains: Three pa- 
tient isolates (NUHP1 , NUHP2, and NUHP3) and four environ- 
mental isolates (NUH1, NUH4, NUH6, and NUH11) were 
sequenced in this study. Whole-genome DNA of these E. ano- 
phelis strains were purified using QIAamp DNA Mini Kit 
(QIAgen) and sequenced on an lllumina MiSeq platform gen- 
erating 1 50-bp-long paired-end reads. Reads were assembled 
into contigs using de novo assembly in CLCBio's Genomics 
Workbench NGS suite (CLCBio, version 6.0.3) with default 
settings. Average genomic coverages across the strains 
ranged from 111 to 160-fold. The assembled genomes 
were compared with the five available Elizabethkingia spp. 
genomes deposited in the National Center for 
Biotechnology Information (NCBI) Database (supplementary 
table S2, Supplementary Material online). 

Multiple genome alignment was used to compare the ge- 
nomes of the Elizabethkingia spp. by using Progressive Mauve 
with match seed weight 15, min Locally Collinear Block 
weight45, minimum island size 50, maximum backbone 
gap size 50, and minimum backbone size 50 (Darling et al. 
2010). Phylogenetic tree diagrams were prepared using 
the software FigTree ver 1.4.0 (http://tree.bio.ed.ac.uk/soft 
ware/figtree/, last accessed May 9, 2014). SNP differences 
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between each pair of Elizabethkingia spp. was calculated 
using the snpTree web server for assembled genomes with 
minimum coverage 10 and minimum distance between SNPs 
10 (Leekitcharoenphon et al. 2012). Contigs of the NUHP1 
genome were aligned with the three other Elizabethkingia 
species to identify homologous sequences (above 100 bp in 
length and at least 80% homology), using BLASTn as imple- 
mented in NCBI BLAST+ v. 2.2.8 (Camacho et al. 2009). A 
visual representation linking homologous regions between 
NUHP1 and the three strains was constructed with CIRCOS 
v. 0.64 (Krzywinski et al. 2009). 

Core/Pan-Genomic Analysis 

Core- and pan-genomes were calculated, and curves for pan- 
genome and core-genome were generated by the CMG-bio- 
tools package with BLAST cutoff of 50% identity and 50% 
coverage of the longest gene (Vesth et al. 2013). The identi- 
fied core- and accessory genomes were BLAST searched 
against the Comprehensive Antibiotic Resistance Database 
(McArthur et al. 201 3) and VFDB (Chen et al. 201 2) to identify 
antibiotic resistant genes and virulence genes by using Bio-Edit 
(Ibis Biosciences, Carlsbad, CA, http://www.mbio.ncsu.edu/ 
bioedit/bioedit.html, last accessed May 9, 2014) (minimum 
30% identity with E value < 1e" 5 ). 

EM-only proteins and EA-only proteins were enriched by 
the CMG-biotools package (Vesth et al. 2013). The identified 
EM-only proteins and EA-only proteins were assigned to 
COGs, and their abundances compared statistically as de- 
scribed in Rodriguez-Brito et al. (2006) and Allen et al. 
(2009) using a subsample size of 500 and 1,000 bootstrap 
replicates at a statistical confidence of 99%. 

Nucleotide Sequence Accession Numbers 

Each of the seven outbreak E. anophelis genomes were de- 
posited at DDBJ/EMBIVGenBank as individual WGS biopro- 
jects. The respective accession numbers of the seven 
E. anophelis genomes are NUHP1 (ASYE00000000), 
NUHP2 (ASYF00000000), NUHP3 (ASYG00000000), NUH1 
(ASYH00000000), NUH4 (ASYI00000000), NUH6 
(ASYJ00000000), NUH11 (ASYK00000000), respectively. 

Supplementary Material 

Supplementary tables S1-S3 are available at Genome Biology 
and Evolution online (http://www.gbe.oxfordjournals.org/). 
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